135 research outputs found

    mbs: modifying Hudson's ms software to generate samples of DNA sequences with a biallelic site under selection

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The pattern of single nucleotide polymorphisms, or SNPs, contains a tremendous amount of information with respect to the mechanisms of the micro-evolutionary process of a species. The inference of the roles of these mechanisms, including natural selection, relies heavily on computer simulations. A coalescent simulation is extremely powerful in generating a large number of samples of DNA sequences from a population (species) when all mutations are neutral, and Hudson's <b>ms </b>software is frequently used for this purpose.</p> <p>However, it has been difficult to incorporate natural selection into the coalescent framework.</p> <p>Results</p> <p>We herein present a software application to generate samples of DNA sequences when there is a biallelic site targeted by selection. This software application, referred to as <b>mbs</b>, is developed by modifying Hudson's <b>ms</b>. The <b>mbs </b>software is so flexible that it can incorporate any arbitrary histories of population size changes and any mode of selection as long as selection is operating on a biallelic site.</p> <p>Conclusion</p> <p><b>mbs </b>provides opportunities to investigate the effect of any mode of selection on the pattern of SNPs under various demography.</p

    Gene expression drives the evolution of dominance.

    Get PDF
    Dominance is a fundamental concept in molecular genetics and has implications for understanding patterns of genetic variation, evolution, and complex traits. However, despite its importance, the degree of dominance in natural populations is poorly quantified. Here, we leverage multiple mating systems in natural populations of Arabidopsis to co-estimate the distribution of fitness effects and dominance coefficients of new amino acid changing mutations. We find that more deleterious mutations are more likely to be recessive than less deleterious mutations. Further, this pattern holds across gene categories, but varies with the connectivity and expression patterns of genes. Our work argues that dominance arises as a consequence of the functional importance of genes and their optimal expression levels

    Extensive Copy-Number Variation of Young Genes across Stickleback Populations

    Get PDF
    MM received funding from the Max Planck innovation funds for this project. PGDF was supported by a Marie Curie European Reintegration Grant (proposal nr 270891). CE was supported by German Science Foundation grants (DFG, EI 841/4-1 and EI 841/6-1). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript

    Duplication and Gene Conversion in the Drosophila melanogaster Genome

    Get PDF
    Using the genomic sequences of Drosophila melanogaster subgroup, the pattern of gene duplications was investigated with special attention to interlocus gene conversion. Our fine-scale analysis with careful visual inspections enabled accurate identification of a number of duplicated blocks (genomic regions). The orthologous parts of those duplicated blocks were also identified in the D. simulans and D. sechellia genomes, by which we were able to clearly classify the duplicated blocks into post- and pre-speciation blocks. We found 31 post-speciation duplicated genes, from which the rate of gene duplication (from one copy to two copies) is estimated to be 1.0×10−9 per single-copy gene per year. The role of interlocus gene conversion was observed in several respects in our data: (1) synonymous divergence between a duplicated pair is overall very low. Consequently, the gene duplication rate would be seriously overestimated by counting duplicated genes with low divergence; (2) the sizes of young duplicated blocks are generally large. We postulate that the degeneration of gene conversion around the edges could explain the shrinkage of “identifiable” duplicated regions; and (3) elevated paralogous divergence is observed around the edges in many duplicated blocks, supporting our gene conversion–degeneration model. Our analysis demonstrated that gene conversion between duplicated regions is a common and genome-wide phenomenon in the Drosophila genomes, and that its role should be especially significant in the early stages of duplicated genes. Based on a population genetic prediction, we applied a new genome-scan method to test for signatures of selection for neofunctionalization and found a strong signature in a pair of transporter genes

    Fine-scale detection of population-specific linkage disequilibrium using haplotype entropy in the human genome

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The creation of a coherent genomic map of recent selection is one of the greatest challenges towards a better understanding of human evolution and the identification of functional genetic variants. Several methods have been proposed to detect linkage disequilibrium (LD), which is indicative of natural selection, from genome-wide profiles of common genetic variations but are designed for large regions.</p> <p>Results</p> <p>To find population-specific LD within small regions, we have devised an entropy-based method that utilizes differences in haplotype frequency between populations. The method has the advantages of incorporating multilocus association, conciliation with low allele frequencies, and independence from allele polarity, which are ideal for short haplotype analysis. The comparison of HapMap SNPs data from African and Caucasian populations with a median resolution size of ~23 kb gave us novel candidates as well as known selection targets. Enrichment analysis for the yielded genes showed associations with diverse diseases such as cardiovascular, immunological, neurological, and skeletal and muscular diseases. A possible scenario for a selective force is discussed. In addition, we have developed a web interface (ENIGMA, available at <url>http://gibk21.bse.kyutech.ac.jp/ENIGMA/index.html</url>), which allows researchers to query their regions of interest for population-specific LD.</p> <p>Conclusion</p> <p>The haplotype entropy method is powerful for detecting population-specific LD embedded in short regions and should contribute to further studies aiming to decipher the evolutionary histories of modern humans.</p

    A revised evolutionary history of the CYP1A subfamily : gene duplication, gene conversion, and positive selection

    Get PDF
    Author Posting. © The Authors, 2005. This is the author's version of the work. It is posted here by permission of Springer for personal use, not for redistribution. The definitive version was published in Journal of Molecular Evolution 62 (2006): 708-717, doi:10.1007/s00239-005-0134-z.Members of cytochrome P450 subfamily 1A (CYP1As) are involved in detoxification and bioactivation of common environmental pollutants. Understanding the functional evolution of these genes is essential to predicting and interpreting species differences in sensitivity to toxicity by such chemicals. The CYP1A gene subfamily comprises a single ancestral representative in most fish species and two paralogs in higher vertebrates, including birds and mammals. Phylogenetic analysis of complete coding sequences suggests that mammalian and bird paralog pairs (CYP1A1/2 and CYP1A4/5, respectively) are the result of independent gene duplication events. However, comparison of vertebrate genome sequences revealed that CYP1A genes lie within an extended region of conserved fine-scale synteny, suggesting that avian and mammalian CYP1A paralogs share a common genomic history. Algorithms designed to detect recombination between nucleotide sequences indicate that gene conversion has homogenized most of the length of the chicken CYP1A genes, as well as the 5’ end of mammalian CYP1As. Together, these data indicate that avian and mammalian CYP1A paralog pairs resulted from a single gene duplication event and that extensive gene conversion is responsible for the exceptionally high degree of sequence similarity between CYP1A4 and CYP1A5. Elevated non-synonymous/synonymous substitution ratios within a putatively unconverted stretch of ~250 bp suggests that positive selection may have reduced the effective rate of gene conversion in this region, which contains two substrate recognition sites. This work significantly alters our understanding of functional evolution in the CYP1A subfamily, suggesting that gene conversion and positive selection have been the dominant processes of sequence evolution.Funding for this work was provided by the NIH Superfund Basic Research Program at Boston University (5-P42-ES-07381) and by the Woods Hole Oceanographic Institution

    Genetic hitchhiking and resistance evolution to transgenic Bt toxins: insights from the African stalk borer Busseola fusca (Noctuidae)

    Get PDF
    Since transgenic crops expressing Bacillus thuringiensis (Bt) toxins were first released, resistance evolution leading to failure in control of pests populations has been observed in a number of species. Field resistance of the moth Busseola fusca was acknowledged 8 years after Bt maize was introduced in South Africa. Since then, field resistance of this corn borer has been observed at several locations, raising questions about the nature, distribution and dynamics of the resistance trait. Using genetic markers, our study identified four outlier loci clearly associated with resistance. In addition, genetic structure at neutral loci reflected extensive gene flow among populations. A realistically parameterised model suggests that resistance could travel in space at speed of several kilometres a year. Markers at outlier loci delineated a geographic region associated with resistance spread. This was an area of approximately 100 km radius, including the location where resistance was first reported. Controlled crosses corroborated these findings and showed significant differences of progeny survival on Bt plants depending on the origin of the resistant parent. Last, our study suggests diverse resistance mutations, which would explain the widespread occurrence of resistant larvae in Bt fields across the main area of maize production in South Africa

    Widespread Gene Conversion of Alpha-2-Fucosyltransferase Genes in Mammals

    Get PDF
    The alpha-2-fucosyltransferases (α2FTs) are enzymes involved in the biosynthesis of α2fucosylated glycan structures. In mammalian genomes, there are three α2FT genes located in tandem—FUT1, FUT2, and Sec1—each contained within a single exon. It has been suggested that these genes originated from two successive duplications, with FUT1 being generated first and FUT2 and Sec1 second. Despite gene conversion being considered the main mechanism of concerted evolution in gene families, previous studies of primates α2FTs failed to detect it, although the occurrence of gene conversion between FUT2 and Sec1 was recently reported in a human allele. The primary aim of our work was to initiate a broader study on the molecular evolution of mammalian α2FTs. Sequence comparison leads us to confirm that the three genes appeared by two rounds of duplication. In addition, we were able to detect multiple gene-conversion events at the base of primates and within several nonprimate species involving FUT2 and Sec1. Gene conversion involving FUT1 and either FUT2 or Sec1 was also detected in rabbit. The extent of gene conversion between the α2FTs genes appears to be species-specific, possibly related to functional differentiation of these genes. With the exception of rabbits, gene conversion was not observed in the region coding the C-terminal part of the catalytic domain. In this region, the number of amino acids that are identical between FUT1 and FUT2, but different in Sec1, is higher than in other parts of the protein. The biologic meaning of this observation may be related to functional constraints

    A high resolution genome-wide scan for significant selective sweeps: an application to pooled sequence data in laying chickens

    Get PDF
    In most studies aimed at localizing footprints of past selection, outliers at tails of the empirical distribution of a given test statistic are assumed to reflect locus-specific selective forces. Significance cutoffs are subjectively determined, rather than being related to a clear set of hypotheses. Here, we define an empirical p-value for the summary statistic by means of a permutation method that uses the observed SNP structure in the real data. To illustrate the methodology, we applied our approach to a panel of 2.9 million autosomal SNPs identified from re-sequencing a pool of 15 individuals from a brown egg layer line. We scanned the genome for local reductions in heterozygosity, suggestive of selective sweeps. We also employed a modified sliding window approach that accounts for gaps in the sequence and increases scanning resolution by moving the overlapping windows by steps of one SNP only, and suggest to call this a "creeping window" strategy. The approach confirmed selective sweeps in the region of previously described candidate genes, i.e. TSHR, PRL, PRLHR, INSR, LEPR, IGF1, and NRAMP1 when used as positive controls. The genome scan revealed 82 distinct regions with strong evidence of selection (genome-wide p-value<0.001), including genes known to be associated with eggshell structure and immune system such as CALB1 and GAL cluster, respectively. A substantial proportion of signals was found in poor gene content regions including the most extreme signal on chromosome 1. The observation of multiple signals in a highly selected layer line of chicken is consistent with the hypothesis that egg production is a complex trait controlled by many genes
    corecore